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A method for transmitting images, and an image coder 

The present invention relates to a method for transmitting images 
according to the preamble of the claim 1. The present invention also 
5 relates to a device for transmitting frames according to the preamble of 
the claim 12. The present invention relates furthermore to an encoder 
according to the preamble of the claim 23, to a decoder according to 
the preamble of the claim 24 to a codec according to the preamble of 
the claim 25, to a mobile terminal according to the preamble of the 
10 claim 26, and a storage media for storing a software program according 
to the preamble of the claim 27. 

The image can be any digital image, a video image, a TV image, an 
image generated by a video recorder, a computer animation, a still 
15 image, etc. One image consists of pixels which are arranged in 
horizontal and vertical lines, and the number of which in one image is 
typically tens of thousands. In addition, the information generated for 
each pixel contains, for instance, luminance information about the pixel, 
typically with a resolution of eight bits, and in colour applications also 
20 chrominance information, e.g. a chrominance signal. This chrominance 
signal further consists of two components, Cb and Cr, which are both 
typically transmitted with a resolution of eight bits. On the basis of these 
fuminance and chrominance values, it is possible at the receiving end 
to form information corresponding to the original pixel on the display 
25 device of a video terminal. In said example, the quantity of data to be 
transmitted for each pixel is 24 bits uncompressed. Thus, the total 
amount of information for one image amounts to several megabits. In 
the transmission of a moving image, several images are transmitted per 
second, for instance in a TV image, 25 images are transmitted per 
30 second. Without compression, the quantity of information to be 
transmitted would amount to tens of megabits per second. However, for 
example in the Internet data network, the data transmission rate can be 
in the order of 64 kbits per second, which makes uncompressed real 
time image transmission via this network practically impossible. 



To reduce the amount of information to be transmitted, different 
compression methods have been developed, such as the JPEG, MPEG 
and H.263 standards. In the transmission of video, image compression 
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can be performed either as inter-frame compression, intra-frame 
compression, or a combination of these. In inter-frame compression, 
the aim is to eliminate redundant information in successive image 
frames. Typically, images contain a large amount of non-varying 
5 information, for example a motionless background, or slowly changing 
information, for example when the subject moves slowly. In inter-frame 
compression, it is also possible to utilize motion compensated 
prediction, wherein the aim is to detect elements in the image which are 
moving, wherein a motion vector and some kind of prediction error 
10 information relating to of this entity are transmitted instead of 
transmitting the pixel values representing the whole entity. Thus, the 
direction of the motion and the speed of the subject in question is 
defined, to establish this motion vector. To enable the use of image 
compression techniques, the transmitting and the receiving video 
15 terminal are required to have such a high processing speed that it is 
possible to perform compression and decompression in real time. 

In several image compression techniques, an image signal converted 
into digital format is subjected to a discrete cosine transform (DCT) 
before the image signal is transmitted to a transmission path or stored 
in a storage means. Using a DCT, it is possible to calculate the 
frequency spectrum of a periodic signal, i.e. to perform a transformation 
from the time domain to the frequency domain. In this context, the word 
discrete indicates that separate pixels instead of continuous functions 
are processed in the transformation. In a digital image signal, 
neighbouring pixels typically have a substantial spatial correlation. One 
feature of the DCT is that the coefficients established as a result of the 
DCT are practically uncorrected; hence, the DCT conducts the 
transformation of the image signal from the time domain to the 
frequency domain in an efficient manner, reducing the redundancy of 
the image data. As such, use of transform coding is an effective way of 
reducing redundancy in both inter-fram and intra-frame coding. 

Current block-based coding methods used in still image coding and 
35 video coding for independently coded key frames (intra-frames) use a 
block-based approach. In general, an image is divided into NxM blocks 
that are coded independently using some kind of transform coding. 
Pure block-based coding only reduces the interpixel correlation within 



20 



25 



30 
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one block, without considering the interblock correlation of pixels. 
Therefore, pure block-based coding produces rather high bit rates even 
when using transform-based coding, such as DCT, which has very 
efficient energy packing properties for highly correlated data. Therefore, 
5 current digital image coding standards exploit certain methods that also 
reduce the correlation of pixel values between blocks. 

Current digital image coding methods perform prediction in the 
transform domain, i.e. they try to predict the DCT coefficients of a block 

10 currently being coded using the previous coded blocks and are thus 
coupled with the compression method. Typically a DCT coefficient that 
corresponds to the average pixel value within an image block is 
predicted using the same DCT coefficient from the previous coded 
block. The difference between the actual and predicted coefficient is 

15 sent to decoder. However, this scheme can predict only the average 
pixel value, and it is not very efficient. 

Prediction of DCT coefficients can also be performed using spatially 
neighbouring blocks. For example, a DCT coefficient that corresponds 

20 to the average pixel value within a block is predicted using the DCT 
coefficient(s) from a block to the left or above the current block being 
coded. DCT coefficients that correspond to horizontal frequencies (i.e. 
vertical edges) can be predicted from the block above the current block 
and coefficients that correspond to vertical frequencies (i.e. horizontal 

25 edges) can be predicted from the block situated to the left. Similar to 
the previous method, differences between the actual and predicted 
coefficients are coded and sent to the decoder. This approach allows 
prediction of horizontal and vertical edges that run through several 
blocks. 

30 

In MPEG-2 compression, the DCT is performed in blocks so that the 
block size is 8 x 8 pixels. The luminance level is transformed using full 
spatial resolution. Both chrominance signals are subsampled, for 
example a field of 16 x 16 pixels is subsampled into a field of 8 x 8 
35 pixels. The differences in the block sizes are primarily due to the fact 
that the eye does not discern changes in chrominance equally well as 
changes in luminance, wherein a field of 2 x 2 pixels is encoded with 
the same chrominance value. 
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The MPEG-2 standard defines three frame types: an l-frame (Intra), a 
P-frame (Predicted), and a B-frame (Bidirectional). The l-frame is 
generated solely on the basis of information contained in the image 
5 itself, wherein at the receiving end, this l-frame can be used to form the 
entire image. The P-frame is formed on the basis of the closest 
preceding l-frame or P-frame f wherein at the receiving stage the 
preceding l-frame or P-frame is correspondingly used together with the 
received P-frame. In the composition of P-frames, for instance motion 

10 compensation is used to compress the quantity of information. B- 
frames are formed on the basis of the preceding l-frame and the 
following P- or l-frame. Correspondingly, at the receiving stage it is not 
possible to compose the B-frame until the preceding and following 
frames have been received. Furthermore, at the transmission stage the 

15 order of these P- and B-frames is changed, wherein the P-frame 
following the B-frame is received first. This tends to accelerate 
reconstruction of the image in the receiver. 

Intra-frame coding schemes used in prior art solutions are inefficient, 
20 wherein transmission of intra-coded frames is bandwidth -exessive. This 
limits the usage of independently coded key frames in low bit rate 
digital image coding applications. 

The system in this invention addresses the problem of how to further 
25 reduce redundant information in image data and to produce more 
efficient coding of image data by introducing a spatial prediction 
scheme involving the prediction of pixel values, that offers possibility for 
prediction from several directions. This allows efficient prediction of 
edges with different orientations, resulting in considerable savings in bit 
30 rate. The system also uses context-dependent selection of suitable 
prediction methods, which gives further savings in bit rate. 

The present invention describes a method to perform spatial prediction 
of pixel values within an image. The technical description of this 
35 document introduces a system for spatial prediction that can be used 
for block-based still image coding and for intra-frame coding in block- 
based video coders. The key elements of the invention are the use of 
multiple prediction methods and the context-dependent selection and 
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signaling of the selected prediction method. The use of multiple 
prediction methods and the context-dependent selection and signaling 
of the prediction methods allow substantial savings in bit rate to be 
achieved compared with prior art solutions. 

5 

It is the object of the present invention to improve encoding and 
decoding of digital image such that higher encoding efficiency can be 
achieved and the bit rate of the encoded digital image can be further 
reduced. 

10 

According to the present invention, this object is achieved by an 
encoder for performing spatially predicted encoding of image data. The 
present invention is primarily characterized in what is presented in the 
characterizing part of claim 1. The device for transmitting frames 

15 according to the present invention is primarily characterized in what is 
presented in the characterizing part of claim 12. The encoder according 
to the present invention is primarily characterized in what is presented 
in the characterizing part of claim 23. The decoder according to the 
present invention is primarily characterized In what is presented in the 

20 characterizing part of claim 24. The codec according to the present 
invention is primarily characterized in what is presented in the 
characterizing part of claim 25. The mobile terminal according to the 
present invention is primarily characterized in what is presented in the 
characterizing part of claim 26. The storage media for storing a 

25 software program according to the present invention is primarily 
characterized in what is presented in the characterizing part of claim 
27. The invention is based on the idea that to perform spatial prediction 
of pixel values for a block to be coded, the adjacent, decoded blocks 
are examined to find if there exists some directionality in the contents of 

30 the adjacent blocks. This directionality information is then used to 
classify those blocks. Based on the combination of the classes of the 
adjacent blocks the contents (pixel values) of the current block is then 
predicted using a suitable prediction method. The prediction method is 
then signalled to the decoder. Prediction error information is also sent if 

35 it is efficient to do that in a distortion vs. bit-rate sense. 

Considerable advantages are achieved with the present invention when 
compared with solutions of prior art. Using a method according to the 
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invention, it is possible to reduce the amount of information needed 
when transmitting images in digital format. 

In general, the method according to the invention can be applied to 
block-based still image coding as well as to intra-frame coding in a 
block-based digital image coder. 

In the following, the invention will be described in more detail with 
reference to the appended figures, in which 

Fig. 1 shows the structure of a digital image transmission system, 

Fig. 2 shows the spatial prediction method of the present invention 
as a block diagram, 

Figs. 3a — 3c show an illustration of blocks that are used for 
prediction according to an advantageous embodiment of the 
present invention, 

20 Fig. 4 shows the mapping of directionality classes to context 
classes according to an advantageous embodiment of the 
present invention, 

Figs. 5a — 5p show an illustration of pixels that are used for 
25 prediction according to an advantageous embodiment of the 

present invention, 



15 



30 



Fig. 6 shows an advantageous bitstream syntax of the 
displacement information transmission, and 

Fig. 7 is a schematic representation of a portable teleconferencing 
device implementing a method according to the invention. 

The intra-frame prediction method described in this invention works in a 
35 block-based manner and can be applied to image frames that comprise 
NxM blocks scanned e.g. row by row from left to right and from top to 
bottom. It is obvious that other scanning directions can also be used in 
connection with the present invention. Spatial prediction is performed 
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for each intra-coded block using previously reconstructed blocks in the 
same frame. The residual error can be compressed with any suitable 
method, e.g. using DCT as in current standards. 

5 The system according to the invention consists of two main parts, as 
illustrated in Figure 2. First, context-dependent selection 17 of a 
suitable subset of prediction methods is performed by classifying 
neighbouring reconstructed blocks. Second, prediction block is 
constructed 18 using one of the prediction methods in the selected 
1 0 subset and the prediction method is signalled to decoder. 

Context-dependent selection of a prediction method subset comprises 
directionality classification of possible neighbouring blocks, mapping of 
directionality classes to context classes and context-dependent 
15 selection of an appropriate prediction method subset. 

In the following, the transmission and reception of digital image frames 
in a transmission system is described with reference to the digital 
image transfer arrangement presented in Figure 1. The current frame 

20 arrives at the transmission system 1 as input data 2 provided, for 
example, as the output of a digital video camera. The current frame 
may be provided in its entirety {i.e. a complete frame comprising NxM 
image blocks), in which case the frame is stored, or the transmission 
system 1 may receive the input data block by block. The blocks of the 

25 frame are directed one by one to a summer 4, where prediction error of 
a block is calculated e.g. by subtracting blocks of the frame from 
predicted blocks. The prediction error is coded in a coder 5 and 
decoded in a decoder 6. In summer 7 decoded prediction error is 
summed with predicted blocks and the result is saved in a frame 

30 memory 8. The prediction estimator 3, where spatial prediction of a 
pixel is performed for every block, according to the method of the 
invention, receives blocks to be used with prediction from the frame 
memory 8. 

35 In order to form a new prediction block, the prediction estimator 3 
examines, if there exists some directionality in possible neighbouring 
blocks of the current block. This scheme is illustrated in Figure 3a. The 
reference C denotes the current block, the reference L denotes the first 
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neighbouring block of the current block and the reference U denotes 
the second neighbouring block of the current block. The first 
neighbouring block is to the left of the current block C and the second 
neighbouring block is above the current block C in this advantageous 
5 embodiment of the invention. If the scanning order is different from left 
to right and from top to bottom, the first neighbouring block L and the 
second neighbouring block U are not necessarily to the left of and 
above the current block C, respectively. The neighbouring blocks L, U 
are such blocks which are adjacent to the current block C and which 

10 already are reconstructed. Furthermore, in some embodiments of the 
invention there can be more than two blocks which are classified and 
used to select the prediction method for the current block C. In the 
following description of the invention, however, at most two 
neighbouring blocks L, U are classified for each block C under 

15 examination. Furthermore, the classification is performed only if a 
neighbouring block L or U exists. If a current block does not have any 
neighbouring blocks, it is treated as "Non-lntra" during the context- 
dependent selection of prediction methods, as will be explained further 
later in the text. 

20 

Prediction can also be implemented in such a way that it is performed 
using only already reconstructed intra-coded blocks. In this case, all 
blocks other than intra-coded blocks are treated as "Non-lntra". 

25 The first neighbouring block L and the second neighbouring block U are 
classified according to the directionality of image details inside the 
block. The directionality classifier 19 analyses the directionality of the 
neighbouring blocks using pixel value gradients. As a result, each 
neighbouring block is mapped 20 into an output class. In an 

30 advantageous embodiment of the invention there are 1 1 such output 
classes, but it is obvious that the number of output classes may vary. 
These output classes consist advantageously of 8 directionality classes 

DO - D7 corresponding to edge orientations k»22.5°, k = 0, 1 7 and 

3 non-directional classes D8 - D10 corresponding to flat, smooth 

35 texture and coarse texture blocks. 

In the system of figure 1 , the prediction estimator 3 first examines, if the 
first neighbouring block L and/or the second neighbouring block U exist. 
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If either one of these blocks does not exist, that neighbouring block is 
defined as a CO block ("Non-lntra"), I.e. the current block C is on the 
edge or in a corner of the frame, or on the edge or in a corner of an 
area consisting of Intra blocks. Then, the prediction estimator 3 selects 
a suitable prediction method for the current block C which is described 
later in this description. Otherwise the prediction estimator 3 calculates 
gradient information relating to the block or blocks L, U. 

There are many suitable methods for calculating the gradient 
information. In the following, one advantageous method is described. 
First, average absolute directional gradients g k , k = 0, 1, ... , 7 of a 
block L, U are defined as 



So = 



N(N-l) 
1 



■max 
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y=0 *=0 
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15 



where N is the size of the block and l(x.y) represent the pixel intensity 
values. Indices x and y refer to coordinates of pixel inside the block and 
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k represents edge orientations. The prediction estimator 3 calculates 
the gradient values g k according to the formulae above. 

Using the gradient values g k , gradient ratios r kj k = 0, 1, , 7 are 
5 defined as the ratio between the gradient value in a certain direction 
and gradient value in the orthogonal direction: 



'0 ' 


r = ■&_ 
Ss 


r 

'2 

St 




1 


1 


1 

r 2 


1 

r 7 = — 
r 3 



(2) 



Based on the absolute gradient values g k and gradient ratios r k defined 
10 in (1) and (2), classification of the block is performed advantageously 
using the following classification steps 1 — -12 with some numerical 
values as thresholds. This classification process classifies each of the 
neighbouring blocks into one of a first set of block types DO — D10. The 
present invention is not limited to the values used in the algorithm, but 
15 the values used in the algorithm in the following steps are preferred. 
The method can also be applied to any block size. 

In this advantageous embodiment of the invention the classification 
phase comprises 13 steps, but it is obvious that the classification may 
20 comprise also different number of steps. 

Step 1 

In this step the flatness of the block is checked. Prediction estimator 3 
calculates gradient values g 0 and g 4 . These correspond to gradient 
25 values for horizontal (0°) and vertical (90°) image details. If both 
g0<2.0 and g4<2.0, the block is classified as D8 and the initial 
classification process terminates. Otherwise classification step 2 is 
performed. 

30 Step 2 

In this step a further check for flatness of the block is performed. The 
rest of the gradient values g k are calculated, and the maximum gradient 
value g ma * = max{g k } is explored. The maximum gradient value g max is 
compared with 2.5. If g max < 2.5 the block is classified as D8 and the 
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initial classification process terminates. Otherwise the method 
continues from step 3. 

Step 3 

5 In step 3 a check for clear directionality is performed. The gradient 
ratios r« are calculated and the minimum gradient ratio r mfn = min{r k } is 
determined. When the minimum gradient ratio is found, the 
corresponding index k min is defined. If r min <0.15 the block is classified 
to corresponding class Dk min and the method continues from step 12, 
10 otherwise the method continues from step 4. 

Step 4 

In step. 4 a check for texture is performed. The minimum gradient ratio 
r min is compared with 0.6. If r min > 0.6 the method continues from step 
15 13, otherwise the method continues from the next step. 

Step 5 

In step 5 the two smallest gradient ratios are checked to determine if 
they are clearly distinct. The gradient ratios r k are sorted in increasing 
20 order r (0 ) < r (1) < r (2 ) ...<r (7) . Also the gradient ratio indices are 
reordered according to the sorted order k (0) , k (1) , k (2 ). ... k (7) . If 
r (» _r co) <-ki r m - r a>) tne slxtn classification step is performed next, 
otherwise the method continues from the 10th classification step. 

25 Step 6 

In step 6 the smallest gradient ratio is checked to determine if it 
corresponds to context class C2 or C6 and the smallest gradient ratio is 
small enough. The prediction estimator 3 first examines, whether the 
index of the gradient ratio r (0) is either 2 or 6, wherein the first gradient 
30 ratio r (0) is compared with 0.6. If r (0) e {r k | k = 2,6} and r (0) < 0.6, the block 

is classified as corresponding to class Dk (0 ) and the method continues 
from step 12. Otherwise the method continues from step 7. 

Step 7 

35 In step 7 the prediction estimator 3 first examines if the index of the 
second gradient ratio r (1) is either 2 or 6, wherein the first gradient ratio 
r (0) is compared with 0.6. If r 0) e {t k \k = 2,6} and r (0) < 0.6 the block is 
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classified as corresponding to class Dk (1) and the method continues 
from the step 12, otherwise the method continues from Step 8. 

Step 8 

5 In step 8 the smallest gradient ratio is checked to determine if it 
corresponds to context class C1, C3, C5 or C7 and the smallest 
gradient ratio is small enough. The first gradient ratio r (0) is compared 
with 0.5. If r (0) e [r k \ k -1,3,5,7} and r (0) < 0.5 the block is classified as 

corresponding to class Dk (0 ) and the method continues from step 12, 
10 otherwise the method continues from step 9. 

Step 9 

In step 9 the second gradient ratio is checked to determine if it 
corresponds to context class C1, C3, C5 or C7 and the smallest 
15 gradient ratio is small enough. The first gradient ratio r (0 ) is compared 
with 0.5, if r 0) e {r k \ k = i ) 3>5,7} . If r (0 > < 0.5 the block is classified as 

corresponding to class Ok^ and the method continues from step 12. 
Otherwise the method continues from step 10. 

20 Step 1 0 

Directionality is not yet found, therefore a (little bit) higher threshold 
value compared with the threshold value used in Step 3 can be tried for 
checking the directionality. This means that more uncertain examination 
is performed. Step 10 uses the values of threshold Ti defined in Table 
25 1, below. The values for Ti are compared with the first gradient ratio. If 
r<o) < Ti defined in Table 1 the block is classified as corresponding to 
class Dk(o) and the method continues from step 12. Otherwise the 
method continues from step 1 1 . 



Orientation Relation for r fm 


T, 


r m e {r„ | k = 2,6} 


0.5 


r (0) e{r 4 |* = 1,3,5/7} 


0.4 


r (0) e{r t | = 0,4} 


0.3 



30 Table 1 



Step 1 1 
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Directionality is not yet found, therefore in step 1 1 the three smallest 
gradient ratios are checked to find out if they are neighbours and if the 
smallest gradient ratio is in the middle. In that case a still higher 
threshold value compared with the threshold value used in Step 3 can 
5 be tried for checking the directionality. This means that more uncertain 
examination is performed. Step 11 uses the values of threshold T 2 
defined in Table 2, below. Then, if the directionalities corresponding to 
the second r (1) and the third gradient ratios r (2) are the closest 
neighbours for the directionality corresponding to the first gradient ratio 
10 r (0) and r (0 ) < T 2 defined in Table 2, the block is classified as 
corresponding to class Dk<o) and the method continues from step 12. 
Otherwise the method continues from step 13. 



Orientation Relation for r f0 \ 


T 2 


r (o, e {r k | ft = 2,6} 


0.6 


r v» e K | k = 1,3,5,7} 


0.5 


r<o, e {r t |*=0,4} 


0.4 



Table 2 



15 

Step 12 

Step 12 performs a check that classification is really based on an edge 
in the image with a certain orientation rather than texture. Step 12 uses 
the values of threshold T 3 defined in Table 3, below. In Table 3 values 

20 for only two possible block sizes (8x8, 4x4) are shown, but in practical 
embodiments other block sizes can also exist, wherein respective 
values for T 3 must be defined. In step 12 the minimum gradient value 
9min = rnin{g k } is examined. Depending on the classification and the size 
of the block, the threshold T 3 is chosen from Table 3. If g min < T 3 the 

25 initial classification process terminates. Otherwise the method 
continues from step 13. 



Classification of the 
Block 


T 3 for 4x4 Block 


T 3 for 8x8 Block 


D2 and D6 


9.0 


7.0 


D1, D3, D5 and D7 


11.5 


9.0 


DO, D4 


14.0 


11.0 



Table 3 
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Step 13 

Step 13 performs a check whether texture is smooth or coarse. The 
maximum gradient value g ma xis compared with 10.0. If g max < 10.0 the 
5 block is classified as D9. Otherwise the block is classified as D10. Step 
13 is not necessarily needed, if both smooth and coarse texture are 
mapped into the same context class. 

Next the selection 21 of a suitable prediction method Is performed for 
10 the current block C. In the preferred embodiment of the invention the 
selection phase is preceded by a mapping phase. The purpose of the 
mapping is to reduce the memory consumption of the implementation. 
Some of the directionality classes can be mapped together. The 
classes resulting from the mapping phase are called as context classes 
15 and they are referred as references C1 - C6. In the preferred 
embodiment of the invention the diagonal classes are combined to two 
alternative classes, one for the bottom-left to top-right diagonally and 
the other for the top-left to bottom-right diagonality. 

20 Mild and steep diagonal classes D5, D6 and D7 are mapped to the first 
diagonal context class C4. Similarly, classes D1, D2 and D3 are 
mapped to the second diagonal context class C5. Further, the smooth 
texture class D9 and coarse texture class D10 are mapped together to 
produce texture context class C6. This mapping is illustrated in Figure 

25 4. 



In addition to the 6 context classes C1 — C6 there is one further context 
class CO used for "Non-lntra" blocks. In general, a "Non-lntra" block is a 
block that does not exist, i.e. when block C is at the image boundary. If 
30 the prediction is implemented in such a way that only intra-coded 
blocks are used as a reference, the definition of a "Non-lntra" block is 
extended to those blocks that are not intra-coded. 



In the preferred embodiment of the invention there are a total of 13 
35 different prediction methods, which are depicted in Figures 5a — 5p for 
8x8 blocks. Prediction methods for other block sizes and context 
classes can be derived similarly. In each case, prediction is performed 
in a causal manner using neighbouring reconstructed intra-coded 
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blocks L, U, UL, UR as a reference. The region used for prediction 
depends on the prediction method as depicted in Figures 3a and 3b 
where block C is the current block to be coded. In case of prediction 
methods P1 — P12, the region from which blocks may be used for 
5 prediction is the area covered by four neighbouring blocks L, UL f U and 
R as shown in Figure 3b. For prediction method P13, this region is 
larger, as depicted in Figure 3c. 

In an advantageous embodiment of the method according to the 
10 invention, a subset of prediction methods for each context class 
combination is defined and the prediction methods are prioritized 
(ranked) in each subset Then, the prediction method used to predict 
the content of the current block C is selected from a subset of 
prediction methods. The prediction methods within a subset differ from 
15 each other and correspond to those prediction methods that are most 
likely to provide an accurate prediction for block C, in the event of 
particular classifications being obtained for neighbouring blocks like L 
and U. One advantageous definition for the subsets is presented in 
Table 4 below. 

20 

Effectively, the results of context classification for the first neighbouring 
block L and second neighbouring block U are combined, /.e. both taken 
into consideration when selecting a prediction method for block C. The 
subset of prediction methods is selected from Table 4 according to the 

25 context information of the neighbouring blocks L, U. Each row of Table 
4 defines the prediction method subset for a certain pair of context 
classes for neighbouring blocks L, U and the priority (rank) of the 
prediction methods in the subset. Ranking is used to simplify for the 
context-dependent signalling of the prediction methods, as described 

30 later in this description. For example, if the first neighbouring block L is 
classified into context class C2 and the second neighbouring block U is 
classified into context class C4, the subset for this combination 
comprises prediction methods P1, P9, P5, P13, P7 and P6 (in ranking 
order). The prediction estimator 3 further selects the most appropriate 

35 prediction method from this subset, as detailed later in this description. 
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Ra 


nk of Prediction Meth< 


ids 


L Class 


U Class 


Rank 1 


Rank 2 


Ranks 


Rank 4 


RankB 


Rank 6 


CO 


CO 


P1 


P5 


P11 


P9 


P8 


P4 


C1 


P1 


P9 


P5 


P8 


P2 


P13 


C2 


P1 


P5 


P2 


P13 


P11 


P3 


C3 


P5 


Pl3 


P1 


P9 


P12 


P7 


C4 


P1 


P8 


P5 


P9 


P6 


P7 


C5 


P1 


P8 


P5 


P3 


P2 


P10 


C6 


P1 


P5 


P9 


P13 


P8 


P12 


CI 


CO 


P9 


P1 


P2 


PI 3 


P8 


P10 


C1 


P9 


P1 


P13 


P2 


P5 


P10 


C2 


P9 


PI 


P2 


P5 


P3 


P11 


C3 


P9 


P5 


PI 


P13 


P4 


P11 


C4 


P9 


P1 


P13 


P5 


P3 


P7 


C5 


P9 


P1 


P13 


P2 


P8 


P10 


C6 


P9 


PI 


P13 


P5 


P11 


P2 


C2 


CO 


P1 


E9 


P10 


P11 


P12 


P7 


CI 


P9 


P1 


P10 


P5 


P11 


P2 


C2 


P1 


P11 


P10 


P2 


P3 


P12 


C3 


P5 


P1 


P11 


P9 


P4 


P13 


C4 


P1 


P9 


P5 


P13 


P7 


P6 


C5 


P1 


P9 


P10 


P11 


P2 


P7 


C6 


P1 


P11 


P9 


P5 


P12 


P10 


C3 


CO 


P5 


P1 


P12 


P9 


P13 


P7 


C1 


P1 


P9 


P5 


P13 


P3 


P11 


C2 


P5 


P1 


P9 


P4 


P13 


P3 


C3 


P5 


PI 


P13 


P9 


P12 


P11 


C4 


P1 


P5 


P9 


P6 


P13 


P7 


C5 


Pi 


P5 


P9 


P13 


P3 


P6 


C6 


P5 


P1 


P11 


P13 


P9 


P12 


C4 


CO 


P1„ . . 


P9 


P7 


PR 


P6 


P13 


C1 


P9 


Pi 


P5 


P13 


P8 


P7 


C2 


P1 


P5 


P9 


P13 


P7 


P11 


C3 


P5 


P1 


P13 


P9 


P7 


P11 


C4 


P1 


P13 


P7 


P9 


P5 


P8 


C5 


P1 


P7 


P9 


P13 


P8 


P4 


C6 


PI 


P9 


P13 


P5 


P7 


P8 


C5 


CO 


P1 


P9 


P10 


P11 


P6 


P7 


C1 


P1 


P9 


P5 


P8 


P10 


P13 


C2 


P1 


P5 


Pi 1 


P4 


P13 


P10 


C3 


P5 


P1 


P13 


P10 


P6 


P4 


C4 


P1 


P8 


P5 


P13 


P10 


P7 


C5 


P1 


P9 


P3 


P5 


P8 


P13 


C6 


P1 


P9 


P5 


P13 


P10 


P8 


C6 


CO 


P1 


P9 


P2 


P5 


PG 


P11 


C1 


P9 


P1 


P5 


P13 


P2 


P3 


C2 


P1 


P9 


P5 


PI 3 


P2 


P11 


C3 


P5 


PI 


P9 


P13 


P12 


P11 


C4 


P1 


P9 


P5 


P10 


P7 


P13 


C5 


P1 


P9 


P13 


P2 


P5 


P7 




C6 


P1 


P9 


P5 


P13 


P11 


P12 



Table 4 



In the following, the defined prediction methods are described in more 
detail. 
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Prediction method P1 

Prediction method P1 predicts the average pixel value of block C from 
the average pixel values of blocks L, UL, U and UR. The average pixel 
5 values dL, dUL and dU of the reconstructed blocks L, UL, and U are 
calculated as the integer division defined as 



d = 



^ x=0,y=0 



/IN' 



(3) 



10 where N is the size of the block, l(x,y) represent the pixel intensity 
values and "//" denotes division with truncation to integer value. The 
average pixel value dC of block C is predicted according to following 
set of rules (which are written below in the form of pseudo-code): 



dC = dL 
dC - s(dL + dU 



1 5 if all blocks L, U and UL exist then 

if dL = dU = dUL then dC = dUL 
else if dUL = dU then dC = dL 
else if dUL = dL then dC = dU 
else if dL = dU then 
20 if chrominance prediction then 

else if | dUL - dL | < 4 then 
else dC = dL 
else if dUL < dL < dU then 
else if dUL < dU < dL then 
25 else if dU < dL < dUL then 

else if dL < dU < dUL then 
else if dL < dUL < dU OR dU < dUL < dL then 

dC = s(dL + dU 

else if blocks L and U exist thendC = (dL + dU +1) // 2 
30 else if blocks L and UL exist then dC = dL 
else if blocks U and UL exist then dC = dU 
else if block L exists then dC = dL 
else if block U exists then dC = dU 
else if block UL exists then dC = dUL 
35 else dC = p 



dUL) 



dC = dU 
dC = dL 
dC = dU 
dC = dL 



dUL) 
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where p is a value that is in the middle of the possible pixel value 
range, e.g. 128, 7T denotes division with truncation and s is a clipping 
function that restricts the values to the possible range of pixel values, 
e.g. between 0 and 255 in a system that uses an 8-bit representation of 
5 luminance/chrominance values. As a result, prediction block for C is 
filled with pixels having a constant value given by dC. 

Prediction method P2 — P4 

Prediction methods P2 through P4 predict diagonal shapes in block C 
10 by extending image details from the upper right direction into block C. 
Prediction is performed by copying reference pixel values at the 
boundaries of blocks U and UR into block C as depicted in Figures 5b, 
5c, 5d. Reference pixels that are marked in gray are connected to one 
or more predicted pixels. The connection is marked as line with dots to 
15 indicate connected predicted pixels, the value of the reference pixel is 
copied to all connected predicted pixels. 

Since one or more reference blocks might be unavailable, i.e. their 
context class may be CO, prediction is performed according to following 
20 rules. 

Rule 1 

If both blocks, U and UR, are classified into one of classes C1 - C6, 
pixel prediction is performed as shown in Figures 5b, 5c and 5d 
25 respectively. For prediction method P2, pixels without any 
corresponding reference pixel in block UR are advantageously 
allocated the value of the rightmost reference pixel in block UR. 

Rule 2 

30 If block U is classified into one of classes C1 - C6 and block UR is 
classified as CO, pixel prediction is performed as shown in Figures 5b, 
5c and 5d for pixels that have a reference pixel in block U. The rest of 
the pixels are advantageously set to the value of the pixel in the lower 
right corner of the reference block U. 

35 

Rule 3 

If block U is classified as CO, the current block C is advantageously 
filled with pixels having a constant value that is substantially in the 
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middle of the possible dynamic range of pixel values, e.g. 128 (in a 
system, that uses an 8-bit representation of luminance/chrominance 
values). 

Prediction method P5 and P9 
Prediction methods P5 and P9 predict vertical and horizontal shapes in 
the current block C by extending image details either from above or 
from the left into the current block C. Depending on the selected 
method (P5 or P9), the reference pixel values at the boundary of either 
block U or L are copied to the current block C as depicted in Figures 5e 
and 5L 



If the context class of the reference block is CO then the current block C 
is advantageously filled with pixels having a constant value that is 
15 substantially in the middle of the possible dynamic range of pixel 
values, e.g. 128 (in a system, that uses an 8-bit representation of 
luminance/chrominance values). 



Prediction method P6, P7 and P8 
20 Prediction methods P6, P7 and P8 predict diagonal shapes in the 
current block C by extending image details from the upper left direction 
into the current block C as depicted in Figures 5f, 5g and 5h. Prediction 
is performed by copying reference pixel values at the boundaries of 
blocks L, UL and U into the current block C according to following rules. 



25 



30 



Rule 1 

If all blocks L, UL and U are classified into one of classes C1 - C6, the 
pixel prediction for the current block C is performed as illustrated in 
Figures 5f r 5g and 5h. 



Rule 2 

If blocks UL and U are classified into one of classes C1 - C6 and block 
L is classified as CO, pixel prediction for the current block C is 
performed as shown in Figures 5f, 5g and 5h for those pixels of the 
35 current block C that have a reference pixel in blocks UL and L. The 
remaining pixels in the current block C are advantageously assigned 
the value of the pixel in the ower left corner of the reference pixel area 
in block UL. 
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Rule 3 

If blocks L and UL are classified into one of classes C1 - C6 and block 
U is classified as CO, pixel prediction for the current block C is 
5 performed as shown in Figures 5f, 5g and 5h for those pixels of the 
current block C that have a reference pixel in blocks L and UL The 
remaining pixels in the current block C are advantageously assigned 
the value of the pixel in the upper right corner of the reference pixel 
area in block UL. 

10 

Rule 4 

If blocks L and U are classified into one of classes C1 - C6 and block 
UL is classified as CO, pixel prediction for the current block C is 
performed as shown in Figures 5f, 5g and 5h for those pixels of the 
15 current block C that have a reference pixel in blocks L and U. Pixels 
with reference pixel in block UL are predicted as shown in Figures 5n, 
5o and 5p. In case of method P7, the predicted pixel value is the 
average of the two reference pixel values rounded to the nearest 
integer value, as indicated in Figure 5o. 

20 

Rule 5 

If block L is classified into one of classes C1 - C6 and blocks UL and U 
are classified as CO, pixel prediction for the current block C is 
performed as shown in Figures 5f, 5g and 5h for those pixels of the 
25 current block C that have a reference pixel in block L. The remaining 
pixels in the current block C are advantageously assigned the value of 
the pixel in the upper right corner of the reference pixel area in block L. 

Rule 6 

30 If block UL is classified into one of classes C1 - C6 and blocks L and U 
are classified as CO, pixel prediction for the current block C is 
performed as shown in Figures 5f, 5g and 5h for those pixels of the 
current block C that have a reference pixel in blocks UL. Pixels of the 
current block C that have a reference pixel in block L are 

35 advantageously assigned the value of the lower/left reference pixel in 
block UL, Pixels of the current block C that have a reference pixel in 
block U are assigned the value of the upper/right reference pixel in 
block UL. 
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Rule 7 

If block U is classified into one of classes C1 - C6 and blocks L and UL 
are classified as CO, pixel prediction for the current block C is 
5 performed as shown in Figures 5f, 5g and 5h for those pixels of the 
current block C that have a reference pixel in block U. The remaining 
pixels of the current block C are advantageously assigned the value of 
the pixel in the lower left comer of the reference pixel area in block U. 

10 Rule 8 

If all blocks L, UL and L are classified as CO, the current block C is 
advantageously filled with pixels having a constant value that is 
substantially in the middle of the possible dynamic range of pixel 
values, e.g. 128 (in a system, that uses an 8-bit representation of 
1 5 luminance/chrominance values). 

Prediction method P10. P11 and P12 

Prediction methods P10 through P12 predict diagonal shapes in the 
current block C by extending image details from the left into the current 
20 block C as depicted in Figures 5j, 5k and 51. Prediction is performed by 
copying reference pixel values at the boundary of blocks L into the 
current block C according to following rules. 

Rule 1 

25 If block L is classified into one of classes C1 - C6> the pixel prediction 
for the current block C Is performed as illustrated in Figures 5j, 5k and 
51. Pixels of the current block C without reference pixel in block L are 
advantageously filled with the value of the pixel in the lower right comer 
of the reference pixel area. 

30 

Rule 2 

If block L is classified as CO, the current block C is advantageously 
filled with pixels having a constant value that is substantially in the 
middle of the possible range of pixel values, e.g. 128 (in a system, that 
35 uses an 8-bit representation of luminance/chrominance values). 
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Prediction method P1 3 

Prediction method P13 predicts the content of the current block C from 
the neighbouring image content by examining if there exists a range of 
pixels having values which substantially corresponds to the pixel values 
5 of the current block C. The prediction of the current block C is 
performed by copying reconstructed pixel values from a reference block 
B that is inside a search range SR as depicted in Figure 5m. Search 
range SR is defined by lists of horizontal (x) and vertical (y) 
displacements. Each pair of horizontal displacement and corresponding 

10 vertical displacement values (x, y) defines a displacement vector 
between the coordinates of upper left corner of the current block C and 
upper left corner of the reference block B. Prediction is allowed only for 
those displacements corresponding to reference block B that is 
completely inside the reconstructed part of the frame. Examples of 

15 displacement pairs using 512 displacements for 8x8 blocks are 
presented in Tables 9a and 9b. In this example the scanning order of 
the tables is from top-left to bottom-right row by row. 

The list of allowed displacements is known to both the encoder and the 
20 decoder, allowing context-dependent signaling of the selected 
reference block location. 

There are many alternative ways to select the prediction method from a 
subset of prediction methods. For example, a cost function can be 

25 defined in order to evaluate the effectiveness of the different prediction 
methods of the subset to be used. The cost function maybe calculated 
on the basis of information concerning the error incurred when 
predicting a current block C using a particular prediction method. This 
error denotes differences between actual pixel values and 

30 reconstructed pixel values. Typically, the error values for each pixel on 
the current block C are squared and summed together to produce a 
squared error measure for the whole block. The cost function may also 
comprise information concerning the number of bits, i.e. the bit rate 
needed to transfer the information to the receiver. The elements of the 

35 cost function, particularly the bit rate, can also be weighted to 
emphasize them. One example of the cost function is 



Cx = D + \R, 
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where cost Cx is defined as a weighted sum of distortion D and rate R 
associated with each of the prediction methods and A. is the weighting 
factor. If the transmission system is band limited, the weight value is 
5 typically larger than if the bandwidth is wider. Now, the values for the 
formula (4) are calculated with different prediction methods and 
preferably that one is selected which results the smallest value for the 
cost function. 

10 Additionally, the prediction error information can also be coded prior to 
transmission to the receiver. Advantageously, there is a subset of 
coding methods defined for each prediction method. Specifically, the 
coding method could be chosen to minimise the number of bits required 
to encode the prediction error. For example, the effectiveness (bit rate) 

15 of the coding method is examined. 

If the prediction error is relatively small, it may not be necessary to 
transmit the prediction error information at all. 

20 When the suitable prediction method is selected, the prediction 
estimator 3 performs spatial prediction 22 according to the selected 
prediction method. The prediction estimator 3 directs the reconstructed 
block to summer 4 where the reconstructed block is subtracted from the 
actual contents of the current frame C to produce prediction error 

25 information for the current block. 

The encoder 1 sends 23 the information about the selected prediction 
method to the multiplexer 9, which is followed by displacement 
information if method P13 is used. Encoding of the information is 
30 advantageously performed using variable length coding. 

The information is further transmitted to the receiver 10, where the 
demultiplexer 1 1 demultiplexes the received information. In the receiver 
10 the prediction information is directed to the predictor 16. The 
35 receiver 10 comprises also a frame memory 14, where the previously 
reconstructed blocks are saved. When a new coded block arrives to the 
receiver, the predictor 16 performs the classifying steps for the 
neighbouring blocks U, L of the received, current block C. Then the 
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predictor 16 carries out the mapping of classification information into 
context classes C1 - C6. After that the predictor 16 examines also the 
rank of the prediction method. The receiver 10 contains the information 
of the Table 4 and 5, wherein the predictor 16 can determine the 
5 correct subset of the prediction methods according to the context class 
combination and the rank. 

When the prediction method is discovered, the predictor 16 can 
reconstruct the current block C and save it to the frame memory 14. In 
10 a situation where also prediction error information is received, that 
information is first decoded in the decoder 12, if necessary, and 
combined with the pixel values of the reconstructed block C. Now the 
current block C is ready to direct to the output 15 of the receiver. 

15 If the prediction method of the current block C is P13, the 
reconstruction of the current block C is performed in a slightly different 
manner. The receiver 10 has to decode also the displacement 
information, wherein the displacement information is used to copy the 
pixel values of the current block C from previously reconstructed pixel 

20 values in the frame memory 14. 

Signalling of the prediction method is advantageously based on the 
context-dependent codes defined in Table 5. After selecting the 
appropriate prediction method, the encoder 1 sends a variable length 

25 codeword that corresponds to the rank of the selected prediction 
method in the context-dependent subset. Advantageous examples of 
variable length codewords representing each prediction method rank 
are listed in Table 5. For example, if the first neighbouring block L is 
classified into context class C3 and the second neighbouring block U is 

30 classified into context class C1, and the prediction method P9 is 
selected from the subset of the prediction methods for this combination, 
the respective rank is 2. Then, the codeword which corresponds this 
rank is "01". 
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Rank 


Code 


Length 


1 


1 


1 


2 


01 


2 


3 


0000 


4 


4 


0001 


4 


5 


0010 


4 


6 


0011 


4 



Table 5 



The receiver 10 is aware of the contents of Table 4, i.e. it knows which 
prediction method corresponds to each of the ranks in every possible 
5 context (combination of classes for the neighbouring blocks L and U). 
Since the receiver 10 can derive the same context information as the 
prediction estimator 3, receiver 10 can associate the rank represented 
by the received codeword to correct prediction method and perform the 
spatial prediction for block C according to the method. 

10 

In an advantageous embodiment of the invention the signalling of 
horizontal and vertical displacements associated with prediction method 
P13 is performed as follows: 

15 Step 1 

Those pairs of horizontal and vertical displacements (X(i), Y(i)) that 
correspond to reference blocks B lying partially or entirely outside the 
frame are eliminated from the ordered list given in Tables 9a, 9b. The 
number of valid pairs is denoted by Nv and the ordered list of valid pairs 
20 which are retained after the elimination is denoted by Lv. 

Step 2 

The rank r (which is one of 1, 2, .... Nv) corresponding to the chosen 
block B within the list Lv created in Step 1 is calculated. 

25 

Step 3 

Based on the value of rank r determined in Step 1 the value index! is 
calculated according to Table 6 is calculated. 
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Step 4 

The value index 2 = r - OffsetLow(index 1 ) is calculated using the values 
listed in Table 6. 



Range for rank r 




OffsetLow 


OffsetHigh 


A ux Length 








(index,) 


(index,) 


(index,) 


1. . 


...2 


1 


1 


2 


1 


3, . 


...4 


2 


3 


4 


1 


5, - 


...6 


3 


5 


6 


1 


7, . 


..,8 


4 


7 


8 


1 


9, 


12 


5 


9 


12 


2 


13, . 


... 16 


6 


13 


16 


2 


17, . 


..,24 


7 


17 


24 


3 


25, - 


.-,32 


8 


25 


32 


3 


33, . 


...48 


9 


33 


48 


4 


49,. 


...64 


10 


49 


64 


4 


65, . 


.., 96 


11 


65 


96 


5 


97, 


128 


12 


97 


128 


5 


129. . 


... 192 


13 


129 


192 


6 


193, . 


.., 256 


14 


193 


256 


6 


257, . 


384 


15 


257 


384 


7 


385, - 


...512 


16 


385 


512 


7 



5 Table 6 



Step 5 

Next, a variable bits is calculated as follows. If Nv < OffsetHigh(index 1 ), 
the value for the variable bits is computed advantageously with formula 
10 bits = [log2(1 + Nv - OffsetLow(indexl))], where [x] denotes the nearest 
integer > x. Otherwise, bits = AuxLength(indexl). 

Step 6 

Depending on the value of Nv the variable whose sub-script is indexi is 
15 encoded using the corresponding Variable Length Coding given in 
Table 7 and Table 8. This codeword is transmitted to the decoder, 
which is illustrated with block CW1 in Figure 6. 

Step 7 

20 If the variable bits is nonzero the binary representation of index2 is 
encoded using a number of bits corresponding to the value of variable 
bits and this codeword is transmitted to the receiver, which is illustrated 
with block CW2 in Figure 6. 
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Since the decoder can derive the ordered list of valid displacement 
vectors, it can associate the rank represented by the received 
codeword with the correct displacement vector. 

5 The block carrying out prediction method according to the invention is 
particularly advantageously implemented in a digital signal processor or 
a corresponding general purpose device suited to processing digital 
signals, which can be programmed to apply predetermined processing 
functions to signals received as input data. The measures according to 
10 the invention can be carried out in a separate signal processor or they 
can be part of the operation of such a signal processor which also 
contains other arrangements for signal processing. 

A storage media can be used for storing a software program comprising 
15 machine executable steps for performing the method according to the 
invention. Then in an advantageous embodiment of the invention the 
software program can be read from the storage media to a device com- 
prising programmable means, e.g. a processor, for performing the 
method of the invention. 

20 

A mobile terminal 24 intended for use as a portable teleconferencing 
device and applying the deblocking filter method according to the 
invention comprises advantageously at least display means 25 for dis- 
playing images, audio means 26 for audio information, keyboard 27 for 
25 inputting e.g. user commands, radio part 28 for communicating with 
mobile network, processing means 29 for controlling the operation of 
the device, memory means 30 for storing information, and preferably a 
camera 31 for taking images. 

30 The present invention is not solely restricted to the above presented 
embodiments, but it can be modified within the scope of the appended 
claims. 



21/01 00 19:32 FAX 03 2886262 



TAMPEREEN PATENT 



[&032 



31 



Claims: 



1 . A method for transmitting digital images, in which method at least 
one frame is formed from the digital image, and the frame is divided 

5 into blocks (C, L, U, UL, UR), characterized in that in the method a 
spatial prediction for a block (C) is performed to reduce the amount of 
information to be transmitted, wherein at least one prediction method 
(P1 — P13) is defined, a classification is determined for at least one 
neighbouring block (L, U) of said block (C) to be predicted according to 
10 the contents of said neighbouring block (L, U), and a prediction method 
(P1— P13) is selected for the current block (C) on the basis of at least 
one said classification. 

2. A method according to Claim 1, characterized in that the 
classification is determined on the basis of directionality information of 

15 the block. 

3. A method according to Claim 2, characterized in that the 
directionality information of the block is determined by calculating at lest 
one gradient value (g k ) on the basis of pixel values of said block. 
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4. A method according to Claim 3, characterized in that the gradient 
values (gO are calculated with the following formula 
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(1) 



where N is the size of the block, l(x,y) represent the pixel intensity 
5 values, indices x and y refer to coordinates of pixel inside the block, 
and k represents edge orientations. 

5. A method according to Claim 4, characterized in that at least 
eight directionality classes (DO - D7) are defined for different edge 
orientations. 

10 6. A method according to any of the Claim 1 to 5, characterized in 
that the classification comprises further 3 non-directional classes (D8 - 
D10) corresponding to flat, smooth texture and coarse texture blocks. 

7, A method according to any of the Claim 1 to 6, characterized in 
that in the method at least two context classes (CO - C6) are defined, 
15 therein a mapping phase is performed, in which the classification 
information (D8 - D10) is mapped into one of said context classes (CO 
-C6). 
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8. A method according to any of the Claim 1 to 7, characterized in 
that in the method a classification is determined for two neighbouring 
blocks (L, U) of said block (C) to be predicted according to the contents 
of said neighbouring blocks (L, U), context classes (CO - C6) are 

5 defined for said neighbouring blocks (L f U), and a prediction method 
(P1 — P13) is selected for the current block (C) on the basis of a 
combination of the defined context classes (CO - C6). 

9. A method according to any of the Claim 1 to 8, characterized in 
that in the method a cost function is defined, wherein the selection of 

10 the prediction method comprises the steps of: 

- calculating a value of the cost function for at least two prediction 
methods, 

- exploring the calculated cost function values to finding the minimum 
value, and 

15 - selecting the prediction method which produces said minimum value 
for the cost function. 

10. A method according to Claim 9, characterized in that the cost 
function is defined as 

20 Cx = D + \R, 

where cost Cx is defined as a weighted sum of distortion D and rate R 
associated with each of the prediction methods and X is the weighting 
factor. 

25 11. A method according to any of the Claim 1 to 10, characterized in 
that in the method a prediction error is defined on the basis of the 
predicted block and the real pixel values of said block (C), and that the 
prediction error information is coded, and the coded prediction error 
information is transmitted- 

30 12. A device for transmitting frames, which have been formed from 
digital images, and the frames are divided into blocks (C, L, U, UL, UR), 
characterized in that the device comprises means for performing 
spatial prediction for a block (C) to reduce the amount of information to 
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be transmitted, wherein at least one prediction method (P1 — P13) has 
been defined, that the device further comprises means for determining 
a classification for at least one neighbouring block (L, U) of said block 
(C) to be predicted according to the contents of said neighbouring block 
5. (L t U), and means for selecting a prediction method (P1 — P13) for the 
current block (C) on the basis of at least one said classification. 

13. A device according to Claim 12, characterized in that the means 
for determining classification comprises means for determining 
directionality information of the block. 

10 14, A device according to Claim 13, characterized in that the means 
for determining directionality information comprises means for 
calculating at lest one gradient value (g k ) on the basis of pixel values of 
said block. 
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15. A device according to Claim 14, characterized in that the gradient 
values (g k ) have been calculated with the following formula 
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where N is the size of the block, l(x,y) represent the pixel intensity 
5 values, indices x and y refer to coordinates of pixel inside the block, 
and k represents edge orientations. 

16. A device according to Claim 15, characterized in that at least 
eight directionality classes (DO - D7) have been defined for different 
edge orientations. 

10 17. A device according to any of the Claim 12 to 16, characterized in 
that the classification comprises further 3 non-directional classes (D8 - 
D10) corresponding to flat, smooth texture and coarse texture blocks. 

18. A device according to any of the Claim 12 to 17, characterized in 
that at least two context classes (CO - C6) have been defined, therein 
15 the device comprises means for performing a mapping phase, in which 
the classification information (D8 - D10) is arranged to be mapped into 
one of said context classes (CO - C6). 
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19. A device according to any of the Claim 12 to 18, characterized in 
that the device comprises means for performing classification for two 
neighbouring blocks (L, U) of said block (C) to be predicted according 
to the contents of said neighbouring blocks (L, U), means for defining 

5 context classes (CO - C6) for said neighbouring blocks (L f U), and 
means for selecting a prediction method (P1 — P13) for the current 
block (C) on the basis of a combination of the defined context classes 
(CO - C6). 

20. A device according to any of the Claim 12 to 19, characterized in 
1 0 that a cost function has been defined, wherein means for selecting a 

prediction method (P1— P13) comprises means for: 

- calculating a value of the cost function for at least two prediction 
methods, 

- exploring the calculated cost function values to finding the minimum 
15 value, and 

- selecting the prediction method which produces said minimum value 
for the cost function. 

21. A method according to Claim 20, characterized in that the cost 
function has been defined as 

20 

Cx = D + ^R, 

where cost Cx has been defined as a weighted sum of distortion D and 
rate R associated with each of the prediction methods and X is the 
25 weighting factor. 

22. A device according to any of the Claim 12 to 21. characterized in 
that the device comprises means for defining a prediction error on the 
basis of the predicted block and the real pixel values of said block (C), 
means for coding the prediction error information, and means for 

30 transmitting the coded prediction error information. 

23. An encoder (1) comprising means for forming frames from digital 
images, and means for dividing the frames into blocks (C, L, U, UL, 
UR), characterized in that the encoder (1) comprises means for 
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performing spatial prediction for a block (C) to reduce the amount of 
information to be transmitted, wherein at least one prediction method 
(P1— P13) has been defined, that the encoder (1) further comprises 
means for determining a classification for at least one neighbouring 
5 block (L, U) of said block (C) to be predicted according to the contents 
of said neighbouring block (L, U), and means for selecting a prediction 
method (P1— P13) for the current block (C) on the basis of at least one 
said classification. 

24. A decoder (10) comprising means for receiving frames, which 
10 have been formed from digital images, and the frames are divided into 
blocks (C, L, U, UL, UR), characterized in that the decoder (10) 
comprises means for performing spatial prediction for a block (C) to 
reduce the amount of information to be transmitted, wherein at least 
one prediction method (P1— P13) has been defined, that the decoder 
15 (10) further comprises means for determining a classification for at 
least one neighbouring block (L, U) of said block (C) to be predicted 
according to the contents of said neighbouring block (L, U), and means 
for selecting a prediction method (P1— P13) for the current block (C) on 
the basis of at least one said classification. 

20 25. A codec (1, 10) comprising means for forming frames from digital 
images, means for dividing the frames into blocks (C, L, U, UL, UR), 
and means for receiving frames, characterized in that the codec (1 , 
10) comprises means for performing spatial prediction for a block (C) to 
reduce the amount of information to be transmitted, wherein at least 

25 one prediction method (P1— P13) has been defined, that the codec (1, 
10) further comprises means for determining a classification for at least 
one neighbouring block (L, U) of said block (C) to be predicted 
according to the contents of said neighbouring block (L, U), and means 
for selecting a prediction method (P1— P13) for the current block (C) on 

30 the basis of at least one said classification. 

26. A mobile terminal (24) comprising means for forming frames from 
digital images, means for dividing the frames into blocks (C, L, U, UL, 
UR), and means for receiving frames, characterized in that the mobile 
terminal (24) comprises means for performing spatial prediction for a 
35 block (C) to reduce the amount of information to be transmitted, 
wherein at least one prediction method (P1— P13) has been defined, 
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that the mobile terminal (24) further comprises means for determining a 
classification for at least one neighbouring block (L, U) of said block (C) 
to be predicted according to the contents of said neighbouring block (L, 
U), and means for selecting a prediction method (P1 — P13) for the 
5 current block (C) on the basis of at least one said classification. 

27. A storage media for storing a software program comprising 
machine executable steps for forming frames from digital images, and 
for dividing the frames into blocks (C, L, U, UL, UR), characterized in 
that the software program further comprises machine executable steps 

10 for performing spatial prediction for a block (C) to reduce the amount of 
information to be transmitted, wherein at least one prediction method 
( p 1 — P13) has been defined, steps for determining a classification for 
at least one neighbouring block (L, U) of said block (C) to be predicted 
according to the contents of said neighbouring block (L, U), and steps 

1 5 for selecting a prediction method (P1 — P1 3) for the current block (C) on 
the basis of at least one said classification. 
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Abstract: 

The invention relates to a method for transmitting digital 
images, in which method at least one frame Is formed 
from the digital image, and the frame is divided into 
blocks (C, L, U, UL, UR). In the method a spatial 
prediction for a block (C) is performed to reduce the 
amount of information to be transmitted, wherein at least 
one prediction method (P1 — P13) is defined. In the 
method a classification is determined for at least one 
neighbouring block (L, U) of said block (C) to be 
predicted according to the contents of said neighbouring 
block (L, U), and a prediction method (P1—P13) is 
selected for the current block (C) on the basis of at least 
one said classification. 



Fig. 2 
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